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Abstract 

We continue our study [2] of the distribution of the maximal number of offsprings 
amongst all individuals in a critical Galton- Watson process started with k ancestors, 
treating the case when the reproduction law has a regularly varying tail F with index —a 
for a > 2 (and hence finite variance). We show that X£ suitably normalized converges 
in distribution to a Frechet law with shape parameter a/2; this contrasts sharply with 
the case 1 < a < 2 when the variance is infinite. More generally, we obtain a weak limit 
theorem for the offspring sequence ranked in the decreasing order, in terms of atoms of a 
certain doubly stochastic Poisson measure. 
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1 Introduction and main results 

Let p = (p(n)) n eN be a probability measure on the set of nonnegative integers; we view p as 
the law of a random variable X which gives the number of children of a typical individual. We 
always assume criticality, i.e. 

oo 

E(X) = $>p(n) = l, 

n=0 

and implicitly exclude the degenerate case p = 8i. We consider a Galton- Watson process having 
reproduction law p. When the latter starts from k > 1 ancestors, we write Tjt for the size of 
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the total population, and then for i = 1, . . . , Tk, we denote by X{ the number of children of the 
i-ih individual, where individuals in the branching process are enumerated by some arbitrary 
procedure (e.g. breadth first search). This note is concerned with weak limit theorems as 
k — > oo for the maximal offspring 

X* k = max{Xi :l<i<T k }, 

and completes results of a previous work [2J which we now briefly recall. Plainly, X% can be 
viewed as the maximum of k i.i.d. copies of X*, and one should naturally expect extreme value 
distributions to appear in the limit. 

Let F(n) = Yli^n+i P(0 denote the tail distribution function of p, and suppose that 

F(n) = rT a i{n) (1) 

where i is a slowly varying function and a > 1. The main result in [2j is that for 1 < a < 2, 
k~ l X k converges in law as k — > oo to a Frechet distribution with shape parameter 1. It is 
rather surprising that this weak limit theorem depends on a only through the scale parameter 
of the Frechet law; in particular in the normalization of X% is independent of a and the slowly 
varying function £ plays no role at all. The assumption 1 < a < 2 implies that the variance 

oo 

a 2 = S ^^n 2 j){n) — 1 

n=0 

of the reproduction law is infinite, and it is therefore natural to wonder whether a similar 
phenomenon also occurs in the case a > 2 for which the variance a 2 is finite. The approach used 
in [2] for 1 < a < 2 relies on the connexion popularized by Harris between branching processes 
and left-continuous random walks, stable limit theorems for the latter, and the structure of 
jumps of stable Levy processes. It does not extend to the case a > 2 since Brownian motion 
then arises in the limit and the jump structure becomes degenerate. 

In order to state the first result of the present contribution, it is convenient to introduce an 
asymptotic inverse of the tail distribution F, that is a function if : (0, 1) — > R + such that 

F(ip(e)) ~ £ as e -> + . 

It is well-known that such a function exists when (CD) holds; further (p then varies regularly at 
0+ with index — 1/a and can be chosen nonincreasing. 
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Theorem 1 Assume (CQ) with a > 2. Then for every x > 0, we have 



lim F(X* k < x^(k-' 2 )) = exp I -— aT a/2 

k— i>oo \ (J 

Theorem [1] contrasts sharply with the results of [2j; this points at the following transition 
for critical branching processes with a reproduction law fulfilling ([!]) and started from k ^> 1 
ancestors. For 1 < a < 2, the maximal offspring is always of order k, while for a > 2 it is 
only /c 2 / Q . More precisely, the maximal offsprings for two such branching processes may have 
much different asymptotic behaviors even when the two reproduction laws have the same finite 
variance, whereas the asymptotic behaviors are always essentially the same when both variances 
are infinite. 

It is well-known and easy to prove that when the reproduction law p has finite variance a 2 , 
the Laplace transform of T&, the total population generated by k ancestors fulfills 

lim E(exp(— ak~ 2 Tk)) = exp (— \p2ajo) , a > 0, 

or equivalently that k~ 2 T k converges weakly towards some stable(l/2) variable r. Comparing 
with Theorem [Q this suggest that it should be interesting to also take into account the role of 
Tk in the study of maximal offsprings. 

In this direction, introduce a sequence (^i)ieN of i.i.d. variables with law p, which we suppose 
further independent of the branching process, and hence of Tk. So (Xi)^ and (?7i)i G N have 
the same law, but plainly the partial sequences (Xj)j<j- fc and {f]i)i<T k do not have the same 
distribution since (Xj) ig N and Tk are not independent (and more precisely there is the identity 
Tk — Xi + . . . + Xx k + 1)- Nonetheless, it is easy to check that 

rf k = max{r]i :l <i <T k } . 

has the same asymptotical behavior as X^ as k — > oo, namely 



lim f(r]l < xip{k 2 )) = exp I x 

k^oo \ (J 



■a/2 



This reflects the fact that when the reproduction law fulfills (OQ) with a > 2, and the number 
of ancestors is large, the most prolific individual in a branching process has a negligible impact 
on the whole process. It is interesting to stress that this phenomenon ceases for 1 < a < 2. 
More precisely, in the case 1 < a < 2, even though the maximal offspring X£ is order k and 
thus much smaller than the total population Tk which has order k a , k -1 ^ and fc _1 X^ do not 
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have the same asymptotic distribution; see Comment 2 in [2] . 

Our second result extends Theorem [1] by considering more generally not just the largest off- 
spring, but also the second, third, ... largest, jointly with the total population size. Specifically, 
we write 

A M - A fc > A fc,2 > A /c,3 ^ 

for the ordered sequence of the offspring numbers {Xj : 1 < i < T k }. 
Theorem 2 Assume (CD) wii/i a > 2. T/ien 

converges in the sense of finite- dimensional distributions as k —t oo towards (t; £i, £2, • ■ 
where r is a positive stable(l/2) variable with law 

a ( a 2 \ 
. exp U , t > , 

and £1 > £2 > . . . denotes the sequence of the atoms ranked in the decreasing order of a random 
measure M on (0, 00) such that conditionally on r = t, M is Poisson with intensity atx~ a ~ l x. 

In particular, we see that the sequence of offspring numbers properly normalized and ranked in 
the decreasing order converges weakly towards the ranked sequence of the atoms of a doubly 
stochastic Poisson measure M (also known as a Cox process). 

The rest of this work is organized as follows. Proofs of the two theorems are given in the next 
section, relying on estimates for Laplace transforms of linear functionals of offspring variables. 
We then gather in Section 3 a number of comments and observations, including some pointers 
at related literature. 

2 Proofs 

Our approach for establishing Theorems [1] and [2] relies on standard techniques for estimating 
Laplace transforms of linear functionals of the empirical measure of offsprings. One could 
also prove Theorem [2] by first rephrasing it in terms of random walks and then using more 
sophisticated tools of weak convergence of cadlag semimartingales in Skorohod's space; however 
this would require knowing a priori the correct orders of magnitudes and also identifying a priori 
dominant contributions and their relations. Therefore we prefer to develop arguments which 
are both more elementary and natural. 
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We also point out that Theorem [T] can be recovered from Theorem [2] and the following 
elementary calculation. Taking Theorem [2] for granted, we have for every x > 



lim Ppr;* t < ip(k~ 2 )x) = P(£i < x) 

= F(M{(x, oo)) = 0) 
= E(exp(-aT a r)) 

/ V2 - a/2 

= exp x 1 

V a 

However it would probably not have been easy to guess a priori the correct renormalization of 
the largest offsprings without knowing beforehand Theorem [TJ 

Our starting point is based on the following elementary feature. Consider a function / : 

N ->■ [0, oo]; we set 

H(f)=jrf(x i ). 

i=i 

and aim at evaluating the Laplace transform 

£(/) = E(exp(-tf(/))) . 
In this direction, we observe from the branching property that £(/) solves the equation 

oo 

s = Y,P(n)e- f(n) s n , se [0,1]. (2) 

n=0 

Since the entire function 

oo 
n=0 

has derivative g'* < 1 on [0, 1] (recall that p is critical) and < <?/(0) < (7/(1) < 1, © has a 
unique solution and thus determines £(/). 

2.1 Proof of Theorem [I] 

The heart of the proof of Theorem [TJ lies in the following lemma. At this stage, we do not 
require to hold. 

Lemma 1 Let p be a critical reproduction law with finite variance a 2 . Suppose that its tail 
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distribution has F(x) > for all x > and that 



r v F ^ n 

lim hm sup - = (J . 

y^oo x^ac F(X) 



Then 



F(X* > x) ~ J2F(x)/a 2 as x -> oo. 



Remark. Note that X{ may have infinite expectation; this occurs for instance when F(x) 
(xlnx) -2 as x — > oo. 



Proof: Let x > and define f(n) = for n < x and = oo for n > x. So 

H(f) 



if X{ < x 
oo if X{ > x 



and if we set C(f) := g(x), then g(x) = F(Xf < x) is the distribution function of X*. In 
particular, ([2]) shows that g(x) is the unique solution to the equation 

q(x)= J]p(n)^) n , 0(2;) e [0,1]. (3) 
Now introduce the generating function g of p, 

oo 

^) = ^p(n) S ", se [0,1]- 

n=0 

Since p is critical with variance a 2 , we have 

g(s) = s + V(l - s) 2 + o((l - s) 2 ) , s 1 - . 
Combining with (J3j), we obtain the estimate as a; — >■ oo 



l -a 2 g{xf + o(^) 2 ) = 5>(n)<?(*r = E ^f^^ ' ^ 

n=l ^ ' 



where £> = 1 — g is the tail distribution of X±. We now claim that 

V , Wj g(x) n+3! ~ 1 as x ^ oo, 

n=l v ' 

which will complete the proof of the statement, thanks to (j3J). 
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Indeed, on the one hand, since (p(x + n)/F(x),n > 1) is a probability measure and g(x) < 1, 
there is the obvious upper bound 

n=l ^ ' 

On the other hand, for every fixed y > 0, we have 

£ %t% ^ n+x > F(x) ~fff +y)) e( X )*™ . (6) 

n=l 

We now see from (J4]) and (jSJ) that 

< \J 2F(x)/cr 2 + o(g(x)) . 

Further, F(x) = o(x~ 2 ) because p has a finite variance. Hence g(x) = o(l/x), which in turn 
entails that 

lim g(x) {l+y)x = 1 . 

X — YOQ 

Plugging this in fl5j), we get that 

hmmf \ v g{x) n+x > hmmf — — ^ , 

x-^oo ^ — ' r (x) x^oo r (x) 

n=l v ' v ' 

and since y can be chosen arbitrarily large, we conclude from the assumption of the lemma that 

limiM^^±^^(x)>l, 

x-»on ' « Hit) 



.,— .. X J " — * /'(./■ 

n=l v 



which ends the proof. □ 

If we now assume that ([1]) holds with a > 2, then the conditions of Lemma [1] are fulfilled. 
Theorem [1] then follows from the fact that, because different ancestors produce i.i.d. branching 
processes, can be viewed as the maximum of k i.i.d. copies of X*, the estimate of Lemma 
[Hand the classical result of Gnedenko (see, for instance Proposition 1.11 in [ID])- 
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2.2 Proof of Theorem [2] 



We consider a continuous function / : M + — > IR + with / = on some neighborhood of 0, and 
a > 0. Recall that ip is an asymptotic inverse of F. For every k > 1, we write 

/*,«(*) = aA;~ 2 + f{x/^(k~ 2 )) . 
The key step consists in estimating 

= 1 - C(f k>a ) = 1 

as — 7- oo. 

Lemma 2 Assume with a > 2. In the notation above, we have 



lim kC(f k ,a) = 

k— >oo 




k — > OO. 



Proof: For the sake of simplicity, we assume that / = on [0,1], the general case only 
requiring slightly heavier notation. The calculations are closely related to those in the proof of 
Lemma dj and we shall therefore sometime provide fewer details. We first recall from (j2J) that 
there is the identity 

OO 

£(/*,«) = $>M exp(-aA;~ 2 - f{n/ V {k' 2 )))C{f Ka ) n , 

n=0 

and then, using the expansion of the generating function g, we get that 

-a 2 C{f k ,a) 2 + o(C(f k , a ) 2 ) = 5>(n) (1 - exp(-aA;- 2 - /(n/^- 2 )))) C(f k , a ) n • (7) 

n=0 

We first use that / = on [0, 1] and C(fk, a ) < lj an d get the upper bounds 

l -a 2 C{f Ka f + o{C{f kA ) 2 ) < P(n) (l - e~ ak - 2 ) + ^ p(n) 

n<ip(k~ 2 ) n>tp(k~ 2 ) 

< ak~ 2 + F( V (k- 2 )) . 



Ti 



E 



exp -ak 2 Ti - ^ f( x i/<p(k~ 




8 



Since F(ip(k 2 )) ~ k 2 , we conclude that C(fk, a ) = 0(l/k). Recall that ip is regularly varying 
at 0+ with index —1/a > —1/2, in particular (p(k~ 2 ) = o(k) and therefore 

lim£(/ fcia )^~ 2 ) = l. (8) 

k— >oo 

We need to estimate the right hand side of ((7j) as k — > oo; in this direction it is convenient 
to decompose the sum depending on whether n < (p(k~ 2 ) or n > ip(k~ 2 ), as the summand has 
different asymptotic behaviors on these two regions. So first, as / = on [0, 1], we deduce from 
(JED that 

V{n) (1 - exp(-ak- 2 - /(nM*- a )))) C(f k , a ) n 

n<ip(k~ 2 ) 

~ Yl p( n ) ( x - e ~ ak ~ 2 ) ~ ak ~ 2 ■ 

n<ip(k~ 2 ) 



Second, the assumption (prj ensures that for every bounded and continuous function h : 
[1, oo) — > R, one has 



POO 

P(n)h(n/ V (k- 2 )) ~ F^{k- 2 ))a / h( 



— rv — 1 
X)X X . 



n>ip(k~ 2 ) 

Since C(fk, a ) < 1 and F(ip{k~ 2 )) ~ A; -2 , we then readily deduce from (jSJ) that 

(l-e-^))^" 1 ?. 

These two estimates combined with ((7|) complete the proof of the lemma. □ 

We can now tackle the proof of Theorem [2j Because the branching processes generated by 
distinct ancestors are independent, we have 

E ^exp \^ak~ 2 T k - f(X t /^k~ 2 )) J J = C(f k , a ) k , 

and it follows from Lemma [2] that 

hm E ( exp \ -ak~ 2 T k - ^ f(Xi/(p(k~ 2 )) 



n=l 



exp 1^-^ (^a + a J™ (1 - e~ f{x) ) x-^y \. (9) 
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We next identify the right hand side of ([9]) as the joint Laplace transform of (r, (M, /)) 
where r is a random variable and M a random point measure on (0, oo] whose joint law is 
specified in Theorem [2j Indeed, we have then, first, 



E(exp(— ar)) = exp(— ^2a/<7 2 ) , 

second, 

E(e" (M ' /) | t = t) = exp | -to jT°(l - e-^aT"- 1 ?! , 

and finally 

E(exp(— ar — (M, /})) = exp i — ( a + a / (l - e" /w ) x" "^ ^ 

Next introduce for each k > 1 the empirical measure of rescaled offsprings when the branch- 
ing process has k ancestors 

1=1 

We ignore the atom of M' k at to get a random point measure on (0, oo], and rephrase (J9]) 
as 

lim E (exp {-afc- 2 T fc - (M k , /)}) = E(exp(-ar - (M, /))) 

for all a > and continuous functions / : (0, oo] — > M + which have compact support. According 
to Theorem 14.16 in [5], this ensures that the pair (k~ 2 Tk,Mk) converges weakly as k — > oo 
towards (r, M), where the space of Radon measures on (0, oo] is endowed with the vague 
topology. This entails Theorem |2j 



3 Miscellaneous remarks 

1. The literature surveyed by Yanev [T2l [13] contains a variety of results about extremes for 
individuals in branching processes. In particular, Rahinov and Yanev [9] have characterized the 
asymptotic behavior as n — > oo of the maximal offspring at the n-th generation conditionally on 
the event that the extinction has not yet occurred; previously Pakes [7] considered asymptotics 
of the largest score up to and including the n-th generation, again conditionally on the event 
that extinction does not occur before the n-th generation. In a different but related direction, 
namely the distribution of the maximum of the branching process itself, we refer to [U [31 16| ITT]. 

2. If one replaces the assumption ([T|) in Theorem [1] by the condition that the tail distribution 
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of the reproduction law decays exponentially, say F(x) ~ ae~ bx for some a, b > 0, then the same 
argument shows that X k — 2b~ l In k now converges weakly as k — > oo to a Gumbel distribution. 
In the same way, the analogue of Theorem [2] can be stated as follows: As k — > oo, 

(k~ 2 T k ; Xl A - 2b- 1 In k, X* k 2 - 2b' 1 In k , . . . , X* kTk - 2b' 1 In k) 

converges in the sense of finite-dimensional distribution towards (r; x±, X2, ■ ■ • ), where r has the 
law 

a fa 2 



exp - — t , t > , 



V2^P V 2t. 

and conditionally on r = t, x\ > x<i > . . . denotes the sequence of the atoms ranked in the 
decreasing order of a Poisson measure on (— oo, oo) with intensity tabe~ bx x. 

3. We point out that Equation ([2]) has a natural interpretation in terms of branching 
processes. In this direction, we may view p/ = (pf(n) = p(n)e~^ n ^) n eN as a probability 
measure on N U {oo} by setting p/(oo) = 1 — Z~2^LoP( n ) e ^ n ^ ■ Then 

oo 

J2p(n)e- f{n) s n , 0<s<l 

n=0 

should be thought of as the generating function of p/, and if we consider a Galton- Watson 
branching process with values in N U {oo} and reproduction law p/ (of course this process is 
absorbed at oo as soon as an individual has an infinite offspring), then we recognize ([2]) as the 
equation for the probability of extinction. 

In particular, the well-known formula of Dwass [1] for the distribution of the total population 
size yields the semi-explicit formula 



oo _ 



1 n 

n=l 



where p*j = p/*. . -*P/ denotes the j-th convolution power of p/. Unfortunately this expression 
seems too complicated to be of some use in practice. 

4. It is natural to view H(f) as a Hamiltonian, C(f) as a partition function, and introduce 
the Gibbs measure 

e -H(f) 

Q = ^^P, 
£(/) ' 

where P denotes the law of the Galton- Watson process with reproduction law p and started from 
a single ancestor. Using (T2]) and for instance Otter's formula (see, e.g. Formula (304) in Pitman 
[8j), it is easily seen that Q is again the law of a Galton- Watson process with reproduction law 
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given by 

q(n) = £(/) n - 1 e- / ( n )p(ri) , neN. 

In particular, it is now elementary to compute the relative entropy Z)(Q||P) of Q with respect 
to P. Indeed, excluding implicitly the trivial case / = 0, Wald's lemma yields 

E Q (fr(/)) = <q ) />E Q (r 1 ), 

with (q, /) = Xl^Lo /( n )?( n )' an d it is a standard fact for sub-critical branching processes that 

oo 

i/Eq(t x ) = i - Eq(Xi) = i - £>?(n) ■ 

n=0 

5. It would be interesting to obtain an analogue of Theorem [2] in the case 1 < a < 2. As a 
first modest step, one should probably try to understand the impact of the maximal offspring 
XI on the total population T k . 
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